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Abstract We propose extreme value analogues of natural exponential families and exponential 
dispersion models, and introduce the slope function as an analogue of the variance function. 
The set of quadratic and power slope functions characterize well-known families such as the 
Rayleigh, Gumbel, power, Pareto, logistic, negative exponential, Weibull and Frechet. We 
show a convergence theorem for slope functions, by which we may express the classical extreme 
value convergence results in terms of asymptotics for extreme dispersion models. The main 
idea is to explore the parallels between location families and natural exponential families, and 
between the convolution and minimum operations. 
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1 Introduction 

In a seminal paper Morris (1982) asked the following question: what do the normal, Poisson, gamma, 
binomial, and negative binomial distributions have in common that makes them so special? His answer 
was that they are all natural exponential families with quadratic variance functions. This idea has wide- 
ranging practical and theoretical ramifications, in particular for generalized linear models (McCullagh 
and Nelder, 1989) and exponential dispersion models (J0rgensen, 1987). 

Many subsequent authors have used the variance function as a characterization and convergence 
tool for natural exponential families and exponential dispersion models, cf. J0rgensen (1997), Casalis 
(2000) and references therein. In particular, Tweedie (1984) and several authors independently of him 
(Morris, 1981; Hougaard, 1986; Bar-Lev and Enis, 1986), proposed and investigated the class of power 
variance functions, corresponding to what we now call the Tweedie class of exponential dispersion models. 
J0rgensen et al. (1994) showed that the Tweedie models appear as limits in a class of convergence results 
for exponential dispersion models, extending certain classical stable convergence results. 

These ideas appear, at first sight, to have little relevance for extreme value theory. Echoing Morris 
(1982) we may ask, however, what distributions like the Rayleigh, Gumbel, power, Pareto, logistic and 
negative exponential have in common that makes them so special in the context of extremes? Also, 
is there an extreme value analogue of power variance functions, perhaps related to the Weibull and 
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Frechet distributions? In the present paper we develop an extreme value dispersion model framework 
in the spirit of J0rgensen (1997), leading to constructive answers to these questions. In particular we 
find a close parallel between the above-mentioned Tweedie convergence results and the classical extreme 
convergence results by Fisher and Tippett (1928) and Gnedenko (1943). See Coles (2001), Kotz and 
Nadarajah (2002) and Beirlant et al. (2004) for background material on extremes. 

In Section [2] we introduce the rate and slope of a distribution as analogues of the mean and variance, 
respectively. In Section [3] hazard location families and slope functions are introduced as analogues of 
natural exponential families and variance functions, respectively. In Section |4] we introduce extreme 
dispersion models as analogues of exponential dispersion models. In Section[5]we classify quadratic slope 
functions in a manner similar to Morris' (1982) classification of quadratic variance functions. In Section 
[5] we draw a parallel between generalized extreme value distributions and Tweedie models, having power 
slope functions and power variance functions, respectively. In Section [7] a general convergence result 
for slope functions is shown, leading to a new proof of the classical extreme value convergence results, 
now set in the extreme dispersion model setting. Finally, in Section [5] we consider characterization and 
convergence for exponential slope functions. 



2 Basic framework 

We now introduce the basic setup for the paper, define the notions of rate and slope for a real random 
variable Y, and show that they are analogues of the mean and variance, respectively. It is convenient 
to use familiar terms from lifetime analysis such as survival function and hazard function, although Y is 
not restricted to be positive. It is also convenient to use minimum (min) rather than the conventional 
maximum. Results for maxima may be obtained by a reflection in the usual way, see Beirlant et al. (2004, 
p. 46). 

2.1 Survival, hazard and integrated hazard 

We assume that the survival function G{y) — P(Y > y) is twice continuously differentiable on the support 
C = (a, b) C R, continuous at a, and possibly discontinuous at b. Let Q denote the set of all G such that 
the density function / = — G' is strictly positive on C, and let Go denote the subset of Q for which OsC. 

When G(b) > we talk about right censoring at b. In particular we allow a positive probability mass 
G(oo) at b — oo, in which case G represents an improper distribution. In survival analysis G(oo) is the 
probability that an individual never experiences the event in question, see e.g. Aalen (1988). 

We define the integrated hazard function H : K — * [0, oo] and the hazard function h : K — * [0, oo] 
corresponding to G by H(y) = — logG(y) and h(y) = H'(y), respectively. It is understood that both 
H and h are to the left and oo to the right of C, except that H(b) is finite if G(b) > 0. With these 
conventions the following relationship holds for all i/Gl, 

H(y)= [ h(x)dx. (1) 

J — oo 

2.2 Rate, slope and semiinvariants 

We now define the rate and slope for Y~, and make the connection with the min operation. 

By way of motivation, let us recall the derivation of the mean and variance from the moment generating 
function. Let the random variable Y have moment generating function M(t) = E (e ty ), with domain 
= {t G R : M(t) < oo} such that G intO. Consider the cumulant generating function k — logM 
whose derivatives at zero k^(0) are the cumulants. The mean and variance, in particular, are given by 

E(F)=r(0) and Var(F) = r'(0), (2) 
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where r = «/ is the mean value mapping, which is strictly increasing on intO. 

We now propose G(y) — E (ly>y) as an analogue of the moment generating function M(t) = E (e tY ), 
which in turn makes H and h analogues of the cumulant generating function k and mean value mapping 
r, respectively. By analogy with we define the rate r and slope s for a random variable Y with 
survival function G € Go by 

r(Y) = h(0) and s(Y) = h'(0), 

respectively. Unlike the variance, however, the slope may be negative as well as positive, and the rate 
decreases (increases) under translation in the IFR (DFR) case. In general we define the ith semiinvari- 
ant by ki(Y) — _ffW(0) for i > 1, provided the derivatives exist, analogously to the cumulants. This 
terminology alludes to T.N. Thiele's name half-invariants for the cumulants, cf. Lauritzen (2002, p. 207). 
Letting /i = r(Y), the slope of Y may be written as follows: 

s(F)=/x{ M -</(0)}, (3) 

where g = — log/. This result is somewhat analogous to the result Var(Y) = E(Y~ 2 ) — E 2 (Y) for the 
variance, see also Section l2~3l 

Like the cumulants, the semiinvariants satisfy a scale equivariance property ki(cY) = c~ l ki(Y) for 
c > 0, which follows from the fact that cY has integrated hazard function H(y/c). In particular the rate 
and slope satisfy 

r (cY) = c -1 r(y) and s(cY) = C - 2 s(Y). (4) 

The rate is not, however, translation equivariant, nor is the slope translation invariant, but instead satisfy 
r(c + Y) = h(-c) and s(c + Y) = h'(-c) for c e R. 

The min of n independent variables Yi has integrated hazard function -ffi(y) + ■ • ■ + H n (y), so the 
semiinvariants are additive with respect to the min operation A, in much the same way that the cumulants 
are additive with respect to convolution. In particular 

(n \ n / n \ n 

A ^ = E r &i) and m A Y A = E a 0Q- (5) 
i=l / i=l \i=l ) i=l 

We denote the scaled min of n independent and identically distributed (i.i.d.) variables Yi by 

n 

Y n = n/\Yi, (6) 

i=l 

which in many ways behaves like the sample mean. Thus, combining ^ and yields 

r (y„) =fi and S (y„) = i (7) 

where fi is the rate and <; the slope of Yi. Also, the exponential distribution is invariant under the 
transformation ^ , behaving like a constant does under averaging. This suggest a law of large numbers 
involving the exponential distribution, as we shall now see. 

2.3 Exponential distribution 

Let denote an exponential variable with rate fj, > 0. By a shifted exponential variable we mean a + E^ 
with a < 0, whose support includes 0. For such a variable we find 

r(a + ^)=/jands(a + £ fl ) = 0, (8) 

parallel to the form for the mean and variance of a constant. In the notation of ([3]), note that <?'(0) = fi 
for the variable a + E^, so © implies that the slope is a signed measure of the deviation of Y from 
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exponentiality, in much the same way that the variance is a measure of the deviation of Y from being 
constant. 

Since the exponential distribution hence plays the role of constant in the present setup, it is not 
surprising that there is a law of large numbers, as suggested by (J7J) and {SJ, that involves convergence to 
the exponential distribution. In fact, the scaled min Y n , after left truncation at 0, has survival function 
given, for y > 0, by 

G n (y/n) r ? o , i.-i . . 

G»(0) = exp \ y ^~ 2n y + °(" )/ asn^oo, (9) 

converging to an exponential distribution with rate /x. Here left truncation at means conditioning on 
the event Y n > 0. The quadratic term suggests a central limit theorem. By removing the term —y/J,, 
corresponding to an exponential component, and rescaling we obtain for y > 



G n (0) 



{-^ 2 + o(l)} asn^oo. (10) 



Provided ? > 0, this gives an asymptotic Rayleigh distribution, which hence plays the role of the normal 
distribution in the present setup. Remark 15.11 below makes precise the idea of removing an exponential 
component. 



3 Hazard location families 
3.1 Motivation 

We now introduce hazard location families, and show that each such family is characterized by its slope 
function, just like a natural exponential family is characterized by its variance function. 

Recall that the variance function is defined on fl = r(int 0) by V(fJ,) = r' (t _1 (//)), where r is the 
mean value mapping defined in connection with @. As pointed out by Morris (1982), V characterizes 
is the distribution of Y up to an exponential tilting. This follows since given V, the inverse t _1 satisfies 
the differential equation 

from which r _1 (/i) may be recovered up to an additive constant —9, say, corresponding to an exponential 
tilting of the density / (not necessarily with respect to Lebesgue measure) 

f(y;6)=f(y)exp{y6- K (6)}. (12) 

This is a natural exponential family (NEF), and (| 1 2|) has mean /x = t(9) and variance V((i), cf. J0rgensen 
(1997, Ch. 2). 



3.2 Definition 

Our analogy implies that the hazard function h should be analogous to the mean value mapping r. In 
order to make the analogy complete, however, we need h, like r, to be monotone. We thus consider from 
now on survival functions in Q with monotone hazard rate, in the sense that h is strictly monotone on C, 
either increasing (IFR) or decreasing (DFR). This subset of Q is denoted G, and we let Go = Q n (?o- 

Remark 3.1 In the DFR case it is necessary that a > — oo in order for the integral flp to converge at 
a. In the IFR case G is always proper, since if h is increasing on (a, oo) then the integral {!]) diverges at 
oo. 
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Table 1: The main quadratic hazard slope families and associated NEFs. GHS is the generalized hyper- 
bolic secant family. 



HL(/i) 


G(y) 


C 






NEF 


Rayleigh 


exp {-y 2 /2) 


R+ 


1 


R+ 


Normal 


Gumbel 


exp (— e v ) 


R 


M 


R+ 


Poisson 


Uniform 


1-2/ 


(0,1) 


M 2 


(l.oo) 


Gamma 


Pareto 


y- 1 


(l,oo) 




(0,1) 




Logistic 


(1 + eT 1 


R 


m(i - m) 


(0,1) 


Binomial 


Neg. exponential 


1 - e y 


R_ 


m(i + /*) 


R + 


Neg. binomial 


Cosine 


cosy 


(0.7T/2) 


1 + yu 2 


R + 


GHS 



Remark 3.2 Consider the case Y = logT, where T is a positive survival time, say. Since then C = R, 
making a — — oo ; only IFR is possible. The derivative of the hazard function for T is 

ft' (log f) - ft(logt) 

T 2 ' 

Hence T need not have monotone hazard rate, even though Y does. In this sense, the assumption of 
monotone hazard rate is less of a restriction when modelling log survival times. 

We now define an analogue of the variance function. For given G € Q we let = h{C) (an open 
interval), and define the slope function v : ^ — > M.± by 

Km) = ft' (ft-'Gu)) • (is) 

into R_ in the 
dh- 1 ^) 1 



Here v maps into R+ in the IFR case and into R_ in the DFR case. Analogously to (jlip we find that 
the inverse hazard function h~ x satisfies 



d/J, v(/i) 



(14) 



Proposition 3.1 The slope function v with domain ^ characterizes the location family G(- — 9) with 
Ssl among all location families within Q . 

Proof: Given v, the solution to (fT4]l is 9 + ft _1 (/i), where 9 £ R is arbitrary. By inversion, we obtain 
the hazard function ft(- — 9) corresponding to the location family G(- — 9). m 

To complete the analogy with natural exponential families we restrict the domain of 9 to — C, such 
that G(- — 9) € Qq. Note that the rate and slope for G(- — 9) are fi = h (—9) and hi (—9) = v([i). This 
leads to the following definition. 

Definition 3.1 The hazard location family {HL(^) : fi G ^} C Q generated from G £ Q is defined by 
the family of survival functions with support C— ft _1 (/i) given by 

y^Giy + h- 1 ^)} . (15) 



Note that the definition of v in (|13|) is independent of the representation (| 1 5(1 used for the family, so 
the slope function represents an intrinsic property of the family. 

Table [T] shows some examples of hazard location families corresponding to familiar distributions, all 
with quadratic slope functions (polynomials of degree at most two), to be studied in Section O Except 
for the Pareto distribution, all the families in the table are IFR. These six IFR families have the same 
functional form for v as the variance functions for Morris' (1982) six natural exponential families. In 
particular, the Rayleigh distribution has constant slope function, like the variance function of the normal 
distribution. 
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3.3 Truncation and censoring 

We now study the effect on v of transformations like truncation and censoring. 

Left truncation at some point c G C gives rise to a new hazard location model with v restricted to a 
subset of '5. Similarly, the operation of right censoring at some point c G C corresponds to replacing Y 
by Y A c, also known as the limited loss variable in loss modelling (Klugman et al., 2004, p. 30). This 
operation reduces C to the subset (a, c) and introduces the probability G(c) in c. We summarize these 
considerations in a lemma. 

Lemma 3.2 Left truncation at c G C corresponds to restricting the domain of v to the interval between 
h{c) and h(b). Right censoring at c G C corresponds to restricting the domain of v to the interval between 
h(a) and h(c). 

In the DFR case, left truncation thus results in the domain W = (h(b), h(c)), whereas right censoring 
gives the domain (h(c), h(a)). The lemma shows that the restriction of v to a subinterval of its domain 
is again the slope function for a hazard location family. When looking for a model corresponding to a 
given functional form for v, we may hence concentrate on the largest possible domain consistent with a 
survival function in Qq. Note, however, that restricting the domain of v to a subset of W implies a change 
in the distributional form, because the support is changed. By comparison, restricting to a subinterval 
of in a natural exponential family selects a subset of the family of distributions, without changing the 
distributions as such. 



4 Extreme dispersion models 
4.1 Motivation 

We now introduce extreme dispersion models as a parallel to exponential dispersion models, and show 
that they satisfy a reproductive property. 

Given a natural exponential family (|12[) with variance function V(fi), the corresponding exponential 
dispersion model ED(^, A) consists of natural exponential families with variance function X~ 1 V(/j,) pro- 
portional to V(p). The latter is then called the unit variance function. The model ED(/i, A) has density 
function of the form 

f(vA\) = h{y)exp[\{vO-K{0)}], 

for a suitable function f\. Here [i = t(9) is the mean, in the notation of ([2]), and a 2 = 1/A is the 
dispersion parameter. The index parameter A has domain A C M + , which is an additive semigroup (often 
R+ or N). 

ED(/i, A) satisfies the following mean reproductive property. The average of n i.i.d. variables Yi, . . . , Y n 
from ED(/u, A) has distribution 

Y n ~ ED(/i, nX), (16) 

where the index parameter is proportional to the sample size. This follows from the form of the moment 
generating function of ED(^, A), which is 

M\t/\ + T-\ii)) 

where M(-) is the moment generating function for / = /i, cf. J0rgensen (1997, Ch. 3). 
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4.2 Definition 



Definition 4.1 Given a survival function G € Q with hazard function h and support C , we define the 
extreme dispersion model generated by G, denoted XD(/i, A), as the family of survival functions in Q$ 
given by 

y^G^y/X + h- 1 ^)) (18) 
with rate index parameter A > and support A (C — h (fj,)j . 

It is straightforward to see that the model XD(/i, A) may be generated from any of its members in 
this way, up to a rescaling of A to cA for some c > 0. In the following we work with the representation 
(|18p corresponding to a specific choice for G. The corresponding hazard and density functions are for 
y G A (C — h^ 1 ^)) given by 

h(y;fi,X) = h(y/\ + h-\p)) (19) 

and 

f(y; n, A) = h(y/X + Jr 1 ^)) exp [-XH {y/X + k^fa)}] , 

respectively. The right extreme of the support is X(b — h^ 1 ^)), which has probability G x (b). 

The parameter n is the rate for XD(^, A) for any value of A > 0, which follows from (1 19[) by inserting 
y = 0, in much the same way that /i is the mean of ED(/i, A) for all A. For each fixed value of A, (jT5J) 
corresponds to a hazard location model with slope function A -1 u(/i). Hence v is called the unit slope 
function for XD(/i, A), and by Proposition 13. li t; characterizes XD(/i, A) up to a rescaling of A like above. 
We call a 2 — 1/A the dispersion parameter. 

The following min reproductive property easily follows from the form of the survival function (|18[) . 
For i.i.d. variables Fi, . . . ,Y n ~ XD(/i, A) the scaled min Y n from ([6|) has distribution 

Y n ~XD(/x,riA). (20) 

This is analogous to the mean reproductive property (|16[) for exponential dispersion models. It is equiv- 
alent to the max-stable property of an exponentiated family of distributions (Nelson and Doganaksoy, 
1995; Sarabia and Castillo, 2005; Nadarajah and Kotz, 2006), but the present formulation emphasizes 
the fact that the rate is preserved under the scaled min operation. In (|2"0"|) . like (TTrj|) , the index parameter 
is proportional to the sample size. 

Let us consider the XD(/z, A) models generated from first six cases in Table [TJ where in fact the 
introduction of the index parameter A corresponds to known generalizations. In the Rayleigh and Gumbel 
cases, this adds a scale or location parameter to the models, respectively. The uniform distribution 
becomes a shifted power distribution. The Pareto becomes a shifted generalized Pareto distribution. 
The logistic becomes a generalized logistic distribution. The negative exponential becomes the negative 
exponentiated exponential, see Nadarajah and Kotz (2006). 

We note in passing the well-known fact that a transformation of the variable Y ~ XD(/i, A) to the 
cumulated hazard scale H(Y/X + h^ 1 ^)) gives an exponential variable with parameter A, possibly right 
censored at the point H(b). 

4.3 Frailty models 

The study of frailty models reveals a certain intimate connection between extreme and exponential 
dispersion models. Let the conditional distribution Y\X = x be exponential with parameter x, and let X 
be a non- negative random variable with moment generating function AI(t). Then the marginal survival 
function for Y is M(—y) for y > 0, which is in effect the frailty model of Vaupel et al. (1979). 
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In the special case where X ~ ED(/it, A) with moment generating function (JTTJ) we obtain the following 
survival function for Y (Hougaard, 1986): 

M^H/M + T-H/i)) 

This is an extreme dispersion model with hazard function h(y) = r(— y), and corresponding unit slope 
function v(fi) = — V(/x), the negative of the unit variance function for Y. This model is hence DFR. 
An example is the generalized Pareto distribution, which is the frailty model corresponding to a gamma 
frailty, with slope function — fi 2 for /i € (0,1). As this example illustrates, the domain for v may be a 
proper subset of that for V. 

In the particular case where X has a positive probability at 0, the distribution of Y becomes improper 
with P(Y = oo) = P(X = 0) > 0. When X follows the Tweedie compound Poisson distribution with 
1 < p < 2 (cf. J0rgensen, 1997, Ch. 4), which has a positive probability at 0, an improper distribution 
for Y is obtained, as pointed out by Aalen (1988). 

4.4 Exponential convergence 

We shall now return to the exponential convergence of Section 12.31 By way of motivation, note that an 
exponential dispersion variable Y ~ ED(^, A) convergences in probability to fi as A — > oo, as is clear 
from (|16p . The analogous result for extreme dispersion models involves convergence to the exponential 
distribution. 

Proposition 4.1 For Y ~ XD(/i, A) and c € K the conditional distribution of Y — c given Y > c is 
asymptotically exponential with rate /i for A — > oo. 

Proof: Let A be large enough to make the support A (C — /i _1 (/i)) contain c. Then the conditional 
survival function of Y — c given Y > c is, for y > 0, 

r( G x ({c + y)/X + h-H P )) 

Gc(2AM ' A) ~ G\c/\ + h-\p)) ■ 

A Taylor expansion of H around c/A + /i _1 (/i) gives 



G c {y;n, A) = exp 



II 2 



-yh {c/A + h-'ip)} - ^h! {c Xy + h- 1 ^)} 



where c\ y is between c/A and (c + y) /A. Letting A — > oo and using the continuity of h and h' , we obtain 
the desired result. ■ 



5 Quadratic slope functions 

We now follow Morris' (1982) footsteps and classify the set of quadratic slope functions. To this end, we 
need to study reflections of slope functions, and the role of exponential components. These transforma- 
tions have a somewhat formal nature, but turn out to be useful for the classification result. For the sake 
of brevity, certain details in this section are left to the reader. 
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Tabic 2: Some vertically reflected quadratic hazard slopes. 



HL(/i) 


G(y) 


C 


v(p) 




Reflected Gumbel 
Reflected Logistic 
Reflected neg. exponential 


exp (1 — y — e v ) 
1/ cosh hy 

a (e-y ~er 2 y) 


K+ 
R+ 
Oog 2, oo) 


(i-/i)(i+M) 
(At - 1) (A* - 2) 


(0,1) 

(o,|) 

(0,1) 



5.1 Reflections 

We now consider what happens when we subject v to a horizontal or vertical reflection. 

Proposition 5.1 Let G G Go have support C = (a, b) and slope function v. Horizontal reflection: If G is 
right censored, then the survival function y i— > G(b)/G(—y) with support — C has hazard function h(—y) 
and slope function —v on \P , and is also right censored. Vertical reflection: Assume that (0, m) C and 
restrict the support to the interval (ao, bo), either (a, ft. _1 (m)) (ZFi? case,) or (/i -1 (m), 6) (DFR case). If G 
is right censored at bo < oo i/ien i/ie survival function y t— > G(— y)/G(bo) exp {— m(y + bo)} wii/i support 
{—bo, — ao) aas sZope function \x > «(m — /i) wii/i domain (0, m), and is rig/rf censored if ao > — oo. 

Proof: Horizontal reflection: The survival function G(b)/G(—y) with support — C is easily seen to have 
hazard function h(—y) and slope function —v(fi) on The value at the right endpoint is G(b)/G(a) > 0, 
so the model is right censored. Vertical reflection: The survival function G(—y)/G(bo) exp {— m(y + bo)} 
with support (—60, ~ a o) is similarly seen to have hazard function m — h{—y) and slope function v(m — fJ,) 
on (0, m). If ao > —00 then G(ao)/G(b ) exp {—m(bo — a )} > 0, so in this case the model is right 
censored. ■ 

Table [2] shows three hazard location families with quadratic slope functions obtained by vertical 
reflection of families from Table Q] 

5.2 Exponential components 

Extending the results of Section ^. 3[ we now show that an exponential component (in the sense of Remark 
15. II below) corresponds to a location change for the slope function. 

Proposition 5.2 Let G € Go with support C = (a,b) have slope function v with domain ^> = (77, 77) - If 
a > —00 then for m > —r\, the function v(fi — m) with domain m + $ is the slope function of the survival 
function given by G{y) exp {— m(y — a)} on C. 

Proof: The survival function G(y) exp {— m(y — a)} has integrated hazard function 

m(y-a)+H(y) (21) 
on C, provided m > —77, with hazard function m + h(y) and slope function v(fx — m) on m + ■ 

Remark 5.1 A positive m in (21)) corresponds to the variable min {Y, a + E m }, where the exponential 
variable E m is independent ofY. In this case we say that we are introducing an exponential component. 
Conversely, when m = —n < we say that we are removing the exponential component, making inf '5 = 0. 
The only model in Table [7] with an exponential component is the uniform distribution. After removing 
the exponential component, we obtain 

G(y) = ey (1 -y) for ye (0,1). (22) 

The corresponding slope function is (1 + /i) 2 with domain $ = R + . 
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Note that, using the terminology of Remark 1 5. 1[ it is understood in connection with the classification 
results below that vertical reflection (Proposition 15. ip is applied only after removing the exponential 
component, if necessary, to ensure that inf = 0. 

An example of an exponential component is encountered in connection with the Gumbel family with 
unit slope function v(fi) = // on f = K.+ . The Gompertz-Makeham distribution is obtained from the 
Gumbel by left truncation at 0, restricting v to /i > 1, and then adding an exponential component. This 
gives the hazard function h(y) = m + eP y for m,/3,y > and unit slope function v(/j,) = /x — m for 
fi > 1 + m. A horizontal reflection, corresponding to f3 < 0, yields v(jx) = m — \i for fi G (m, m + 1). 



5.3 Classification 

We have now considered several types of transformations of slope functions, including censoring, trun- 
cation and reflections. In addition to these, we consider the following three transformations of a given 
hazard location family HL(/x) with slope function v and domain \&. 

1. Location change: removing or adding an exponential component maps v into v(n — m). 

2. Scale transformation: a scale transformation of Y maps v into c~ 2 v(c/ji) for c > 0. 

3. Multiplication: generating an extreme dispersion model maps v into v/X for A > 0. 
A combination of these three operations maps v into 

7 «((/i-a)/j9), (23) 

where 7, [3 > and S a + /3vE f . We refer to ([23]) as the operation of location and scaling. This leads us 
to the main classification theorem. 



Theorem 5.3 Up to left truncation, right censoring, reflection, location and scaling, the only hazard 
location models with quadratic slope functions are those shown in Table [3 

The next remark will useful for the proof. 

Remark 5.2 Consider G € Q and c,d £ C. The following identity 



r-h(d 

H(d) - H(c) = / 

Jh(c) 



-f- dn (24) 



follows by the substitution fi = h(x) in the integral (Qp. It is useful for checking if a given function v 
may serve as a slope function. By taking c = a, we find that the continuity of G at a is equivalent to the 
integral ^24) being convergent at h(a). By taking d — b, we find that right censoring is equivalent to the 
integral being convergent at h(b). 



Proof: [of Theorem l5.3| By means of the location and scaling operation we may reduce the classification 
problem to quadratic slope functions with simple forms like in Table [1] having roots either ±1, or i. 
A combination of vertical and horizontal reflections applied to the seven cases of Table Q] then covers 
all possible shapes of quadratic slope functions, most of which are right censored (cf. Proposition 15. 
Regarding the uniform and Pareto distributions, an application of Remark l5. 21 shows that neither /j, 2 nor 
—/j 2 can be slope functions on = K + , but only on a subset of M.+ . It follows that horizontal reflections 
of the uniform and Pareto distributions give rise to two separate cases of right censored slope functions 
of the form ±/x 2 , and a further four cases of the form (/1 — m) 2 that follow by vertical reflection. It is 
easily seen that this covers all possible cases. ■ 
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Tabic 3: Summary of generalized extreme value distributions. 



EV 7 (/i,A) 


7 


P 


Support 


Weibull 


7 < -1 


p>2 


(l/7,oo) 


Exponential 


7 =-l 




(-l,oo) 


Weibull 


-1 < 7 < 


p < 1 


(1/7,00) 


Gumbel 


7 = 


p= 1 


E 


Frechet 


7 > 


1 < p < 2 


(-00, 1/7) 



Remark 5.3 There are three cases with inf C = —00 in Tabled where a vertical reflection leads to models 
that are not right censored, of which typical examples are shown in Table [H 

One could also explore parallels of other classification results for NEFs, such as Letac and Mora's 
(1990) cubic variance functions, but this is outside the scope of the present paper. 



6 Generalized extreme value distributions 

We now investigate the analogy between the generalized extreme value distribution and the Tweedie 
class of exponential dispersion models. The latter is characterized by having unit variance functions of 
power form V(n) — /i p , cf. J0rgensen (1997, Ch. 4) and references therein. Here p = corresponds 
to the normal distribution with domain R, whereas the remaining cases, namely p < and p > 1, all 
have domain M + . The special cases p — 0, 1, 2 all appear in Table[TJ and a further simple case is p = 3, 
corresponding to the inverse Gaussian distribution. 

6.1 Definition 

The standard generalized extreme value distribution for minima is defined by 

G(y)=exp{-(l- 7 y)- 1/7 }, 

with support defined by jy < 1. Here 7 € K, and the value 7 = (defined by continuity) corresponds the 
Gumbel distribution. All extreme value distributions except the exponential (7 = —1) have monotone 
hazard rates and their slope functions are of power form 

v(n) = -^—n v (25) 
2-p 

for [i > 0, where the parameter pgt\{2} is defined by 

P=p{l)= 1 -^ 1 . (26) 
1 + 7 

The models are IFR for p < 2 (7 > — 1) and DFR for p > 2 (7 < — 1). As we saw in the proof of Theorem 
I5.3l there is no slope function on R + proportional to fi 2 . Table [3] summarizes the main cases of generalized 
extreme value distributions corresponding to different values of p. 

Introducing location and index parameters, the generalized extreme value distributions are seen to be 
examples of extreme dispersion models, one for each 7. We thus define the EV 7 (/i, A) to be the extreme 
dispersion model given by the survival function 



y 1 — ► exp 



-A(V 7/(1+7) -7y/A) 



-l/ 7 
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with support denned by jy < Xfi 7/(1+7^ which is a reparametrization of the usual extreme value 
distribution. The EV 7 (p, A) model satisfies the following scaling property 

cEV 7 (/i, A) = EV 7 (c~V, c 2 ~ p A). (27) 

6.2 Characterization 

The Tweedie models may be characterized as the only exponential dispersion models closed under scale 
transformations, cf. J0rgensen (1997, p. 128). We now show, by means of Proposition l3Jl that the extreme 
value distributions satisfy a similar property. 

Theorem 6.1 Let XD(fi, A) be such that for some A > and all /i, c > 

cXD(fx,X)=XD(c- l t i,g x (c)) (28) 
for some positive function g\(c). Then XD(fi,X) is a generalized extreme value distribution. 

Proof: First note that the rate c _1 /z on the right-hand side of (|28| is consistent with (j4|). Since c > is 
arbitrary this in turn implies that $ = R+. Without loss of generality we may take A = 1. Calculating 
the slope function on both sides of (f28|) gives 

C~ 2 v(fJb) = — rTr«(c - V). 

91 (c) 

This implies that v satisfies the functional equation v(x)v(y) = v(l)v(xy) for x,y > 0. Using the 
continuity of v, the solution is v(p) = c p fi p , where p € R and c p is an arbitrary non-zero constant that 
may depend on p. When p ^ 2 and c p — 1/(2 — p) this characterizes the generalized extreme value 
distribution EV 7 (/i, A). Other choices for c v with the same sign correspond to a scale change. Changing 
the sign to c p = — 1/(2— p) is possible only for a right censored survival function (Proposition [5TT]) , which 
is incompatible with the condition $ = M + . The case p = 2, which has been dealt with in Sectional also 
is not compatible with the condition ^ — E + . It easily follows that g\(c) — Ac 2_p , in agreement with 
(E3). ■ 



7 Convergence of extremes 
7.1 General convergence theorem 

The results of the previous section show that the Tweedie and generalized extreme value distributions 
share certain properties due to the common form of their variance and slope functions. We shall now 
complete this analogy by showing a convergence theorem for slope functions, which in turn leads to a 
new proof of the extreme value convergence theorem, along the same lines as the Tweedie convergence 
theorem of J0rgensen et al. (1994). 

The use of variance functions for proving convergence for natural exponential families was initiated by 
Morris (1982), but a rigorous formulation and proof was first given by Mora (1990). The convergence the- 
orem for variance functions says that if a sequence of variance functions converges uniformly on compact 
sets, then the corresponding sequence of natural exponential families converges to the family correspond- 
ing to the limiting variance function. We have the following analogous result for slope functions. The 
proof is given in an Appendix. 



12 



Theorem 7.1 Let v n , ty n — {r\ ,f} n ) be a sequence of slope functions and their respective domains, all 
IFR (DFR), such that "J = int (lim n _>oo ^n) exists and is non-empty, where linin^oo ^ n means that each 
of the two sequences of endpoints converges. Assume that v n converges on ^ , uniformly on compact 
subintervals of ^ , to a function v which is strictly positive (strictly negative) on ty. Assume that v n 
satisfies the following left tightness condition. For each k > there exists an rj £ \P such that for all n 



M dy, < k, (29) 



where I n {il) — { r l n i r l) (In(v) — (.ViVn))- Then the corresponding sequence of hazard location families 
HL„(/k) converges weakly for each /i€f, uniformly on compact subintervals of the support, to the hazard 
location family HL(/i) with slope function v. 

The tightness condition (|2"5|) originates from the identity The following remark shows that a 

similar condition is useful for determining if the limiting family is right censored. 



Remark 7.1 Under the assumptions of Theorem \7.1\ we consider the following right tightness condition. 
For each k > there exists an rj £ *S> such that for all n 



d/i > k, 



KO)l 



where In(i]) — (?7,?7„) (IFR case) or I n (rj) = (77^,77) (DFR case). Then for every k > there exists 
a c 6 C such that H n (c) > k for all n. This, in turn, implies that H(c) — lim H n (c) > k, and hence 
h{b—) = 00. This implies no right censoring, so in particular the limiting distribution is proper. 

7.2 Extreme convergence theorem 

Let 7 7^ —1 be given, and let p — 73(7) 7^ 2, according to (|26|) . Choosing c in the scaling formula (f27|) 
such that n — c 2 ~ p is an integer, we obtain 

n 1 /(p-2) E v 7 (n 1 /(P- 2 ) /J , nX) = EV 7 (/i, A). (30) 

Now recall the min reproductive property ([2D)) , by which the left-hand side of (j3"0|) represents a centering 
and scaling of the scaled min Y n for a sample of size n from EV 7 (/i, A). In effect ([30]) represents the 
so-called stability postulate for the limiting distribution of extremes, cf. Kotz and Nadarajah (2002, p. 5). 
The corresponding domains of attraction correspond to the classical extreme convergence result, which 
in the present setup takes the following form. 

Theorem 7.2 Let XD(/i, A) be an extreme dispersion model having unit slope function v with power 
asymptotics of the form 

«M ~ ^— M P (31) 
2 -p 

as fi — » (IFR case with p < 2) or fi — * 00 (DFR case with p > 2). Then for any /i, A > 

n i/(p-2) XD ( n i/(p-2)^ nA ) EV 7 (/j, A) asn^ 00, (32) 

where — > denotes weak convergence. 

Proof: Consider the IFR case p < 2, where the power asymptotics holds near 0. For fixed values of A 
and n, the left-hand side of (|32"|) is a hazard location family with slope function 



= 77 — ^f(n 1/(p 2 V) — ► tt— rM p as n 
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where we have used the scaling property of the slope in Q . The pointwise convergence follows from (f3"Tj) . 
To show that the convergence is uniform in [i on compact subsets of R+, let < fi < m for given m > 0. 
For given e > let p, be such that 

vQj) 1 



< £ 



fjP 2-p 

for fx < /zo, by the assumption of power asymptotics. Then for any n large enough to make n 1 /( p ~ 2 ' 1 
Ho/m we find 



< 



i/(p-2) 



/0 



n p/(p-2) 



v(n 



V(p-2) 



1 



( M n 1 /(P-2)) P 2-p 



for all (i < to, which shows the uniform convergence. Since we are in the IFR case, the tightness condition 
([291) involves the integral 

U (n 1 /(P-2) A1 ) d »- 

Since the integrand behaves asymptotically like the power which is integrable on (0,7/) for p < 2, 

the tightness condition is satisfied. The convergence (|3"2")) hence follows from Theorem 17.11 The proof in 
the DFR case p > 2 is similar. ■ 



Compared with conventional extreme value results, the framework of Theorem 17.21 is very convenient, 
albeit under the rather strong conditions of differentiability of the density, and monotone hazard rate. 
We note that the condition (|3"Tj) seamlessly integrates the Gumbel case (p — 1) with the rest, whereas 
the exponential case is not included, see Remark 17.31 

We note that the approach leads to the new centering constant 6 — — /i _1 (7i 1 ^ p_2 V), the location 
parameter appearing in (|18p . which corresponds to keeping the rate constant at the value /j throughout 
the convergence (J32J . 

In simple cases, like in Table [IJ it is very easy to read off the asymptotic behaviour of the slope 
function. For example, the asymptotic behaviour of i; near for the logistic and negative exponential 
distributions is v(p,) ~ /i, so both are in the domain of attraction of the Gumbel distribution. Further 
examples are considered below. 

Remark 7.2 The von Mises conditions are sufficient conditions involving the density f for extreme value 
convergence. For G € Q with support (a,b), the version of the Gumbel condition proposed by Falk and 
Marohn (1993) is (keeping in mind that we use min rather than max) 



lim • 



via 1 - G(y) 

for some c > 0. By I'Hospital's rule this is equivalent to 

lim — = c. 

via h{y) 

By inserting y = /i _1 (/i), we find that \33\) is equivalent to \3l 
however, less clear. For a = the Gumbel condition is 



(33) 

with p = 1. The situation for p 1 is, 



lim 



yf(y) 



-i- 1 > o, 



vio 1 - G(y) 

or equivalently, with an application of V Hospital's rule, 

lim ^ = _ 1 _ 7 -, 

yio h(y) 

This condition apparently cannot be expressed conveniently in terms of the slope function v. 
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Remark 7.3 Contrary to conventional extreme value convergence theory, our framework separates out 
the case of exponential convergence, and Proposition ^. l\ illustrates how exponential convergence is prompted 
by left truncation, see also (GJ). The uniform and Pareto examples from Tabled illustrate that distribu- 
tions in the domain of attraction of the exponential distribution have incomplete VP , with inf ^ > (IFR 
case) or sup W < oo (DFR case). In the case of the uniform distribution with the exponential component 
removed \22\) . the new slope function (1 + /i) 2 satisfies \31\) with p — 0, and so is in the domain of 
attraction of the Rayleigh distribution. 

J0rgensen and Martinez (1997) developed Tauberian methods for variance functions, where power 
asymptotics for V is replaced by regular variation. This could be developed in the present setting along 
the lines of de Haan (1970), but is outside the scope of the present paper. 

7.3 Examples 

Let us consider two further examples of extreme value convergence that illustrate Theorem 17.21 First 
we consider the negative Pareto distribution with survival function G(y) = 1 — (1 — y) 1 for y < 0. 
Straightforward calculations show that the corresponding slope function is 

v(p) = MvV 2 + V for (i > 0, (34) 

which behaves like 2/x 3 / 2 near 0. Letting XD(/j, A) denote the extreme dispersion model corresponding 
to G, an application of Theorem 17.21 yields Frechet convergence, 

n~ 2 XD(n~ 2 /i, nX) — EVi(/i, A) as n — * oo. 

It is worth noting that v in ([54"]) is of the so-called Letac form (J0rgensen, 1997, pp. 157-158), a class of 
variance functions that has been extensively studied, see e.g. Kokonendji (1994). 

Next, we consider the Burr distribution with survival function G(y) = (1 + y a ) 1 for y > 0, for some 
a > 0, which is DFR for < a < 1. An explicit expression for the slope function may be found in the 
case a — 1/2, where 

v(fj,) = -[L 2 Ux + 2 + V V 2 + 2/iJ f or I 1 > 0- 

The asymptotic behaviour is u(/i) ~ — 2fi 3 as fi — > oo. An application of Theorem 17.21 yields Weibull 
convergence with 7 = — 2, 

nXD(nfi, nX) EV_ 2 (m, A/2) as n — > 00, 

where XD(/j, A) denotes the extreme dispersion model generated by G. For < a < 1 the behaviour of 
v is like — \i v with p — (a — 2) /(a — 1) > 2, and (|3"2")1 applies. 

For a > 1 the Burr hazard is not monotone, but is for y near 0. Hence by a suitable right censoring, 
we obtain an IFR model with asymptotic behaviour for v with p < 1. In general Theorem 17. 21 may be 
applied in this way to models with non-monotone hazard as long as the hazard is monotone near 0. 

8 Exponential slope functions 

We now consider characterization and convergence for exponential slope functions, similar to J0rgensen's 
(1997, p. 160) characterization of exponential variance functions. These results have independent interest, 
since exponential variance functions correspond to natural exponential families generated by extreme 
stable distributions with stability index a = 1. 
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8.1 Characterization 

Elaborating on the parallel between the Rayleigh and normal distributions, we note that the latter satisfies 
the following transformation property: 

N(m + (j,, a 2 )-m = NQi, a 2 ) 

for all m G R, imitating ()30p . but with multiplication replaced by addition. From (jlOp . we would expect 
in the Rayleigh IFR case that the term — m corresponds to a left truncation followed by the removal of 
an exponential component. More generally, given XD(/z, A) with unit slope function v on -0 = R + , we 
consider the shift transformation, defined by the following two steps. 

1. Left truncation (IFR) or right censoring (DFR), which restricts the domain to /i > in, while 
maintaining the slope at A _1 v(/i). This gives rise to an exponential component. 

2. Removing the exponential component, giving the rate \i > and slope A v(m + /i). 

The result is an extreme dispersion model XD m (/i, A) with unit slope function v(m + •). We now 
characterize exponential slope functions as fixed points for the shift transformation. 

Theorem 8.1 Let XD(/j,,\) have unit slope function v and domain ^> = R + . If for some A > there 
exists a positive function g\(m) such that for all m,/i > 

XD m (/x, 5A (m))=XD(/i ) A) ) (35) 

then the unit slope function v is either constant or exponential. 

Proof: By calculating the slope on both sides of ([33)1 we obtain the equation 

—^—v{m + fj) = \~ 1 v(n). 

Without loss of generality we may take A = 1. By letting /i J. and using the continuity of v, we find 
that the limit i>(0+) exists, is positive and finite, and gi(m) = v(m)/v(0+). This, in turn, implies that 
v satisfies the functional equation v(0+)v(m + fj,) = v(m)v(fj,) for all m, /i > 0. Taking into account the 
continuity of v, the solution is 

v(fi) = u(0+)e^ (36) 
for some [3 € R, which in turn implies that (|35[) holds for all A > with g\{m) = \e^ m . m 

Besides the Rayleigh case (/? = 0) there are two main cases of (|36|) . one IFR and one DFR. The IFR 
case has unit slope function v(fi) — e _AI for /i > 0, and corresponds to the extreme dispersion model 
generated from the survival function 

G(y) = e y (1 + yy (1+y) for y > 0. (37) 

The DFR case has unit slope function v(n) = — e M for /i > 0, and corresponds to the extreme dispersion 
model generated from the survival function 

G(y) = e"V for < y < 1, (38) 

which is right censored at 1. 

Note that by applying a suitable location and scaling operation to the power slope function (|25p for 
p > 2 we obtain 

\ p 



H — > -e M for p — > oo, 

PJ 

which shows that the DFR case of (|5B"|) is a limiting case of the generalized extreme value family. A 
similar result holds in the IFR case. 
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8.2 Convergence 

We now show a convergence theorem for exponential slope functions, similar to a result for exponential 
variance functions (J0rgensen, 1997, p. 164). In effect, the fixed point ([55]) has a domain of attraction 
consisting of models with asymptotically exponential slope functions. 

Theorem 8.2 Let XD(/i, A) denote an extreme dispersion model with unit slope junction v and domain 
^ = M + having exponential asymptotics of the form 

v{fi) ~ c e^ (39) 

for /i — > oo, where eg = 1 for (3 < and c/3 = —1 for (3 > 0. T/jen i/ie shifted model XD m (^,,Xe^ m ) 
converges to an extreme dispersion model with exponential slope function for m — > oo . 

Proof: The shifted model XD m (/i, Xe 13 " 1 ) has unit slope function 

e~ ,3m u(TO + fi) -> c^e' 3 '" for m -> oo, (40) 

pointwise for /x > 0. To show that the convergence is uniform in /i on compact subsets of R+, let 
< /x < mo for given mo > 0. For given e > let /iq be such that 

|e" /3M u(^) — c^l < e 

for fi > (Uo i by (|39|) . Then for any m > /xo we find 

|e^ m u(m +/*) - c/je^| = e -^ m+ ^v(m + /*) - C/J < (l + e^™) e 

for all /x < mo, showing uniform convergence. The tightness condition (|29l) involves the integral 

A/i 



Mu) |e-^(m + /i)| 



d/x, 



where the integrand behaves asymptotically like /xe - ^. For /? > the interval of integration is (77,00), 
whereas for /? < it is (0, 77), so in both cases the tightness condition is satisfied. The result now follows 
from Theorem 17. II ■ 



There are three main cases of (|40|) . DFR case. Take j3 = 1 and let m be such that n = e m is an 
integer. We may then write the convergence as follows: 

XDi ogn (/i, An) — > XD_(/x, A) for n — > 00, (41) 

where XD_(/i, A) is the model generated by (|38|) . The left-hand side of ([4l"j) represents a shift transfor- 
mation of the scaled min Y n for a sample of size n from XD(/x, A). Rayleigh case. For /3 = we obtain 
convergence to the Rayleigh distribution, 

XD m (/j, A) — ► EV_i (/x, A) for m — * 00. 

/Fi? case. Take [3 = -1 and let t = e" m . Then 

XD_ logt (/a,At) ^ XD+(/j,A) for t | 0, 

where XD + (/x, A) is the model generated by ((37)) . This in effect involves the asymptotic distribution of 
an extremal process X t for 1 1 0, much like the infinitely divisible type of convergence of J0rgensen (1997, 
p. 149). This follows by noting that to every XD(/x, A) model there exists an extremal process X t , in the 
sense of Dwass (1964), such that tX t ~ XD(/x, Xt). 
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Appendix: Proof of general convergence theorem 



The proof of Theorem 17.11 proceeds along the same lines as J0rgensen's (1997, p. 54) proof of the con- 
vergence theorem for variance functions, which in turn is a simplification of Mora's (1990) proof in the 
multivariate case. The idea is to reconstruct the hazard function h from the limiting slope function v 
using (|14p . and in turn use the uniform convergence and tightness to show convergence of the sequence 
H n . 

Let if be a given compact subinterval of \& . By assumption \& = int (lim ^ n ), so we may assume that 
if C <J„ from some no on. We only need to consider n > uq. Fix a /io £ int if. Let %p n — h^ 1 denote 
the inverse hazard function given by ip' n (/x) = l/w„(/i) on Hf n and ip n (/io) = 0, cf. (fT4"l) . Let h n , H n etc. 
denote the quantities associated with this parametrization. 

Similarly, define ip '■ W — ► R by ip' (/z) = l/v(fi) on W and ip(no) — 0. Then for fi G if 

WM-tfM\ = K M:^ . (42) 

By the uniform convergence of Vnilj) to i>(/z) on if, it follows that v n (fi) is uniformly bounded on if. Since 
v([i) is bounded on if, it follows from (|42|) and from the uniform convergence of w n that ^ (/z) — > V' (m) 
uniformly on if. This and the fact that ip n (a*o) = VKmo) for all n implies, by a result from Rudin (1976, 
Theorem 7.17, p. 152), that ip n (fi) — > ip (ji) uniformly on if. 

Let C n = ip n (*„) and C = ^(*). Then C = int (limC„). Let J = ip(K) C C and J„ = ^„(if ) C C n . 
Define h : C — > ^ by /i(y) = i/j^ 1 (y). Since ^ is strictly monotone and differentiable, the same is the case 
for h, and /i(y) > on C since C M + . Let ^ G if be given and let y = VKa*) G J an d J/n = i'n(fJ-) G -in- 
Since v„([a) is uniformly bounded on if, there exists an m such that |w„(/i)| < m for all n and /i G if. It 
follows that |/i^(y)| < m for all y € J. Since fi — h(y) = h n (y n ) we find, using the mean value theorem, 
that 

\K{y) - Kv)\ = \K(y) - K{y n )\ 
< rn\y-y n \ 
= rn\ip(n) - V>n(M)l • 

This implies that h n {y) — » ft,(y) uniformly in y G J. The above arguments also apply if J is extended to 
a larger subinterval of C. 

In order to invoke the tightness condition (j29|) , we first consider the IFR case. Using (|24l) we obtain 
for c G C 

rh n (o) 

H n (c)= T-T^d». (43) 

For a given 77 G ^ we may choose e > and c G C such that /i(c) + e < i], and from the convergence of 
h n (c) to h(c) we obtain 77 < h n (c) < r\ for n large enough. Together with (|29p this implies that for every 
k > there exists a c = c(k) G C such that for all n 

< H n (c) < k. (44) 

Since all H n are increasing we can make c(k) an increasing function of k. In the DFR case the inequality 
(fJ3| follows similarly by integrating over the interval (h n (c),rj n ) in d43|) . 
The condition (|4"4")) implies that there exists a c G C such that 

lim inf / h n (x) dx — lim inf H n (c) < 00. (45) 

n — >oc / n — >oo 

./ —00 
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By Fatou's lemma, (|4"5")) implies that f° h(x)dx < oo. We may now define G and H for y £ R by 
H (y) = ./.^ h(x)dx, and G(y) = exp{— iJ (j/)}, where we use the conventions discussed in connection 
with ((T|). Then G is a survival function with support C, and _ff(infC) = 0. Using the above-mentioned 
result from Rudin once more, we find that for any given d in J, H n (y) — H n (d) converges to H (y) — H{d) 
uniformly in y £ J. 

To conclude the proof, we choose a d S J, and show that H n (d) converges to H(d). The tightness 
condition (|44|) implies that, for given k > and c < c(fe), H n (d) satisfies 

H n {d) - H n (c) < H n (d) < H n (d) - H n (c) + k. 

We may enlarge J to include c. Letting (ncowc find that H n (d) is asymptotically squeezed between 
the values H(d) — H (c) and H(d) — H (c) + fe, which can be made arbitrarily close to H(d) by choosing 
k small, and c close to inf C. Hence H n (d) converges to H(d). It follows that H n (y) converges to H (y) 
uniformly in y £ J, completing the proof. 
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